A text-to-speech engine can usually translate individual words to speech successfully. However, as soon as the engine speaks a sentence, the perceived quality of its translation decreases because the engine cannot correctly synthesize human prosody -- the inflection, accent, and timing of human speech. You can change a speaking voice by inserting commands in the text file.
Note: Before using control tags, review the Syntax Rules and Conventions.
The prosody of translated speech can be improved by using text-to-speech control tags to better simulate human speech. The following is a list of text-to-speech control tags that can be embedded in the source text to improve the prosody of text-to-speech translation:
(some speech engines may not support this tag) | |
(new for SAPI 4.0 -- some speech engines may not support this tag) | |
(some speech engines may not support this tag) |
RmW - Reading Mode Audible Pauses (new for SAPI 4.0 -- some speech engines may not support this tag) |
(new for SAPI 4.0 -- some speech engines may not support this tag) |
(new for SAPI 4.0 -- some speech engines may not support this tag) |
(some speech engines may not support this tag) |
(new for SAPI 4.0 -- some speech engines may not support this tag) |
(new for SAPI 4.0 -- some speech engines may not support this tag) | |
(new for SAPI 4.0) | |
(new for SAPI 4.0 -- some speech engines may not support this tag) |
(some speech engines may not support this tag) |
(some speech engines may not support this tag) |
|
(some speech engines may not support this tag) |
|